NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

More than Marketing? On the Information Value of AI Benchmarks for Practitioners

https://doi.org/10.1145/3708359.3712152

Hardy, Amelia; Reuel, Anka; Jafari_Meimandi, Kiana; Soder, Lisa; Griffith, Allie; Asmar, Dylan M; Koyejo, Sanmi; Bernstein, Michael S; Kochenderfer, Mykel John (March 2025, International journal of studies on art and humanities)

Free, publicly-accessible full text available March 24, 2026
Active Preference-Based Gaussian Process Regression for Reward Learning and Optimization

Biyik, Erdem; Huynh, Nicolas; Kochenderfer, Mykel; Sadigh, Dorsa (September 2023, The International Journal of Robotics Research (IJRR))

Full Text Available
Optimality Guarantees for Particle Belief Approximation of POMDPs

https://doi.org/10.1613/jair.1.14525

Lim, Michael H.; Becker, Tyler J.; Kochenderfer, Mykel J.; Tomlin, Claire J.; Sunberg, Zachary N. (August 2023, Journal of Artificial Intelligence Research)

Partially observable Markov decision processes (POMDPs) provide a flexible representation for real-world decision and control problems. However, POMDPs are notoriously difficult to solve, especially when the state and observation spaces are continuous or hybrid, which is often the case for physical systems. While recent online sampling-based POMDP algorithms that plan with observation likelihood weighting have shown practical effectiveness, a general theory characterizing the approximation error of the particle filtering techniques that these algorithms use has not previously been proposed. Our main contribution is bounding the error between any POMDP and its corresponding finite sample particle belief MDP (PB-MDP) approximation. This fundamental bridge between PB-MDPs and POMDPs allows us to adapt any sampling-based MDP algorithm to a POMDP by solving the corresponding particle belief MDP, thereby extending the convergence guarantees of the MDP algorithm to the POMDP. Practically, this is implemented by using the particle filter belief transition model as the generative model for the MDP solver. While this requires access to the observation density model from the POMDP, it only increases the transition sampling complexity of the MDP solver by a factor of O(C), where C is the number of particles. Thus, when combined with sparse sampling MDP algorithms, this approach can yield algorithms for POMDPs that have no direct theoretical dependence on the size of the state and observation spaces. In addition to our theoretical contribution, we perform five numerical experiments on benchmark POMDPs to demonstrate that a simple MDP algorithm adapted using PB-MDP approximation, Sparse-PFT, achieves performance competitive with other leading continuous observation POMDP solvers.
more » « less
Full Text Available
Coordinated Multi-Agent Pathfinding for Drones and Trucks over Road Networks

Choudhury, Shushman; Solovey, Kiril; Kochenderfer, Mykel J.; Pavone, Marco (January 2022, Autonomous agents and multiagent systems)

Full Text Available
A Review of Incident Prediction, Resource Allocation, and Dispatch Models for Emergency Management

https://doi.org/10.1016/j.aap.2021.106501

Mukhopadhyay, Ayan; Pettet, Geoffrey; Vazirizade, Sayyed Mohsen; Lu, Di; Jaimes, Alejandro; Said, Said El; Baroud, Hiba; Vorobeychik, Yevgeniy; Kochenderfer, Mykel; Dubey, Abhishek (February 2022, Accident Analysis & Prevention)

Full Text Available
Global optimization of objective functions represented by ReLU networks

https://doi.org/10.1007/s10994-021-06050-2

Strong, Christopher A.; Wu, Haoze; Zeljić, Aleksandar; Julian, Kyle D.; Katz, Guy; Barrett, Clark; Kochenderfer, Mykel J. (October 2021, Machine Learning)

Neural networks can learn complex, non-convex functions, and it is challenging to guarantee their correct behavior in safety-critical contexts. Many approaches exist to find failures in networks (e.g., adversarial examples), but these cannot guarantee the absence of failures. Verification algorithms address this need and provide formal guarantees about a neural network by answering "yes or no" questions. For example, they can answer whether a violation exists within certain bounds. However, individual "yes or no" questions cannot answer qualitative questions such as “what is the largest error within these bounds”; the answers to these lie in the domain of optimization. Therefore, we propose strategies to extend existing verifiers to perform optimization and find: (i) the most extreme failure in a given input region and (ii) the minimum input perturbation required to cause a failure. A naive approach using a bisection search with an off-the-shelf verifier results in many expensive and overlapping calls to the verifier. Instead, we propose an approach that tightly integrates the optimization process into the verification procedure, achieving better runtime performance than the naive approach. We evaluate our approach implemented as an extension of Marabou, a state-of-the-art neural network verifier, and compare its performance with the bisection approach and MIPVerify, an optimization-based verifier. We observe complementary performance between our extension of Marabou and MIPVerify
more » « less
Full Text Available
Efficient Large-Scale Multi-Drone Delivery using Transit Networks

https://doi.org/10.1613/jair.1.12450

Choudhury, Shushman; Solovey, Kiril; Kochenderfer, Mykel J.; Pavone, Marco (January 2021, Journal of Artificial Intelligence Research)

We consider the problem of routing a large fleet of drones to deliver packages simultaneously across broad urban areas. Besides flying directly, drones can use public transit vehicles such as buses and trams as temporary modes of transportation to conserve energy. Adding this capability to our formulation augments effective drone travel range and the space of possible deliveries but also increases problem input size due to the large transit networks. We present a comprehensive algorithmic framework that strives to minimize the maximum time to complete any delivery and addresses the multifaceted computational challenges of our problem through a two-layer approach. First, the upper layer assigns drones to package delivery sequences with an approximately optimal polynomial time allocation algorithm. Then, the lower layer executes the allocation by periodically routing the fleet over the transit network, using efficient, bounded suboptimal multi-agent pathfinding techniques tailored to our setting. We demonstrate the efficiency of our approach on simulations with up to 200 drones, 5000 packages, and transit networks with up to 8000 stops in San Francisco and the Washington DC Metropolitan Area. Our framework computes solutions for most settings within a few seconds on commodity hardware and enables drones to extend their effective range by a factor of nearly four using transit.
more » « less
Full Text Available
Hierarchical planning for resource allocation in emergency response systems

https://doi.org/10.1145/3450267.3450543

Pettet, Geoffrey; Mukhopadhyay, Ayan; Kochenderfer, Mykel J.; Dubey, Abhishek (January 2021, 12th ACM/IEEE International Conference on Cyber-Physical Systems,)
null (Ed.)
A classical problem in city-scale cyber-physical systems (CPS) is resource allocation under uncertainty. Spatial-temporal allocation of resources is optimized to allocate electric scooters across urban areas, place charging stations for vehicles, and design efficient on-demand transit. Typically, such problems are modeled as Markov (or semi-Markov) decision processes. While online, offline, and decentralized methodologies have been used to tackle such problems, none of the approaches scale well for large-scale decision problems. We create a general approach to hierarchical planning that leverages structure in city-level CPS problems to tackle resource allocation under uncertainty. We use emergency response as a case study and show how a large resource allocation problem can be split into smaller problems. We then create a principled framework for solving the smaller problems and tackling the interaction between them. Finally, we use real-world data from a major metropolitan area in the United States to validate our approach. Our experiments show that the proposed approach outperforms state-of-the-art approaches used in the field of emergency response.
more » « less
Full Text Available
Active Preference-Based Gaussian Process Regression for Reward Learning

https://doi.org/10.15607/rss.2020.xvi.041

Biyik, Erdem; Huynh, Nicolas; Kochenderfer, Mykel; Sadigh, Dorsa (July 2020, Robotics: Science and Systems)
null (Ed.)
Full Text Available
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal Constraints

https://doi.org/10.15607/rss.2020.xvi.068

Choudhury, Shushman; Gupta, Jayesh; Kochenderfer, Mykel; Sadigh, Dorsa; Bohg, Jeannette (July 2020, Robotics: Science and Systems)
null (Ed.)
Full Text Available

« Prev Next »

Search for: All records